Python Notebook 6: Optimisation

Course on Statistical Machine Learning\ University of Oxford\ Author: F. Caron

Gradient descent demo

Without normalisation

With normalisation

Stochastic gradient descent (batch size=1)

Gradient descent, Early stopping and implicit regularisation